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It has recently been shown that there are substantial differences 
in the regularity behavior of the empirical process based on scalar 
diffusions as compared to the classical empirical process, due to the 
existence of diffusion local time. Besides establishing strong parallels 
to classical theory such as Ossiander's bracketing CLT and the gene- 
ral Gine-Zinn CLT for uniformly bounded families of functions, we 
find increased regularity also for multivariate ergodic diffusions, as- 
suming that the invariant measure is finite with Lebesgue density n. 
The effect is diminishing for growing dimension but always present. 
The fine differences to the classical iid setting are worked out using 
exponential inequalities for martingales and additive functionals of 
continuous Markov processes as well as the characterization of the 
sample path behavior of Gaussian processes by means of the generic 
chaining bound. To uncover the phenomenon, we study a smoothed 
version of the empirical diffusion process. It turns out that uniform 
weak convergence of the smoothed empirical diffusion process under 
necessary and sufficient conditions can take place with even expo- 
nentially small bandwidth in dimension d = 2, and with strongly 
undersmoothing bandwidth choice for parameters /3 > d/2 in case 
d > 3, assuming that the coordinates of drift and diffusion coefficient 
belong to some Holder ball with parameter /?. 



1. Introduction. Let X be a stationary, ergodic diffusion process on E C K d 
with invariant probability measure fi, and denote its infinitesimal generator on 
L 2 (E, d/z) by A. We refer to Section 2 for precise definitions. The functional central 
limit theorem for stationary ergodic Markov processes due to Bhattacharya (1982) 
states that for any fixed t > and any fixed function / of the form / = Ag, 



where cr 2 (f ) := —2 f„ f(x)g(x)[i(dx) and W = (W t )t>o denotes a standard Wiener 
process. In particular, 




(n — > oo) 




(n — > oo). 
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The passage to a continuous-time result is obvious. Given any finite set of func- 
tions fx, ... , f m solving the Poisson equation = Agi, i = 1, . . . , m, the law of the 
m-dimensional process with components i -1 / 2 f Q Agi(X u )du, i — l,...,m, con- 
verges weakly to an m-dimensional centered Gaussian distribution with asymptotic 
covariances given by 



-/ Ag l (x)g j (x)^(dx) - / g i (x)Ag j (x)fi(dx), i,j = 1, . . . , m, 

JE J E 

by means of the Cramer- Wold device. 

Classical empirical process theory is concerned with limit results which hold uni- 
formly over entire (possibly infinite-dimensional) classes of functions. The funda- 
mental object of investigation is the classical empirical process in the setting of 
independent, identically distributed random variables X\, ...,X n ~ P 

K assic (f)) f ^ ■■= (-tE(/(*o-E/(W) . 

\ V i=l / }eJ r 

where J 7 is a class of functions, typically in L 2 (dP). Lindeberg's CLT gives conver- 
gence of the finite-dimensional marginals whenever the variance of f{X\) is finite. 
In order to extend this result to a CLT in i°° (J 7 ) which holds uniformly over some 
infinite-dimensional function class, the existence of the limiting Gaussian process 
does not suffice. 

It was shown in van der Vaart and van Zanten (2005) that the empirical process of 
a regular scalar diffusion on an interval I C M with finite speed measure behaves 
substantially different. In fact, weak convergence of the empirical process takes 
place in if and only if the limit exists as a tight, Borel measurable map. 

The proof of this result is heavily based on an analysis of diffusion local time. One 
crucial point is the fact that the empirical measure of a univariate regular diffusion 
is continuous with respect to Lebesgue measure. For dimension d > 2, diffusion 
local time does not exist, and it is not obvious how to derive uniform limit results 
under minimal conditions. In particular, the empirical measure of a multivariate 
diffusion is no longer Lebesgue-continuous. 

Our first results for the so-called empirical multivariate diffusion process 
(G t (/)) /GF := (\ f JiX.Mu 



Vt 



parallel to a large extent results from classical empirical process theory. However, 
we find increased regularity for the empirical diffusion process also in higher di- 
mensions. The effect is diminishing with growing dimension but always present. To 
uncover this phenomenon, we investigate the subsequent modified version of the 
empirical process 

y/tj f(x)n t , h (x)d\(x) = Vt Jj(x) (±J K d«) A(dx), 
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with A denoting the Lebesgue measure. Smoothed empirical processes based on 
iid random variables have been investigated by different authors (see, e.g., Yukich 
(1992) or van der Vaart (1994)). They found recently particular interest due to 
the observation that uniform weak convergence of smoothed empirical processes 
may be deduced even in situations when the original empirical process is not 
tight (Radulovic and Wegkamp (2000, 2003, 2009), Gine and Nickl (2008); see also 
Mendelson and Zinn (2006)). The results however seem to be of limited scope. For 
d = 1 and without further smoothness assumptions on the class J-, Radulovic and Wegkamp 
(2009) require that the bandwidth h n of the kernel estimator satisfies at least 
nh\ — > oo. Under slightly different assumptions, Gine and Nickl (2008) even need 
that nh\ — > oo. The proofs, based on the decomposition given in Theorem 3.2 
in Gine and Zinn (1984), use closeness of the kernel density estimator p n to the 
underlying density p in a mean-squared sense, where p is assumed to be at least 
diffcrcntiable already in the univariate situation. Our proof relies on a completely 
different approach, based on martingale approximation and theory of Markovian 
semigroups. We do not use explicitly the closeness of wt,h to the Lebesgue density 
7r of the invariant measure in a mean squared sense. It turns out that uniform 
weak convergence of the smoothed empirical diffusion process under necessary and 
sufficient conditions can take place with even exponentially small bandwidth in 
dimension d = 2, and with strongly undersmoothing bandwidth choice for parame- 
ters f3 > d/2 in case d > 3, assuming that the coordinates of drift and diffusion 
coefficient belong to some Holder ball with parameter (3. Maybe surprisingly, the 
performance of the smoothed empirical diffusion process can be guaranteed even if 
the bandwidth is too small for ensuring consistency of 7?^. 



2. Preliminaries. 



2.1. Notation and definitions. Let (E,B(E)), E c R d , be a Borel measurable 
space, and let X = (AT t ) t > be an i?-valued Markov process with invariant proba- 
bility measure \x. Its transition semigroup is denoted {Pt) t >o with corresponding 
transition densities p t (-, •). The infinitesimal generator A of (P t ) is defined on the 
domain 

Ptf ~ f 



t 



fJ 



-> for some g€B o ,t^0 



sup 



by Af := lim^o (Ptf ~ f) /t, the limit being taken in sup-norm, where B := {/ G 
Cq ■ \\Ptf — /ll SU p as t —> 0} denotes the center of the semigroup, and C is 
the space of continuous functions / with f(x) — ¥ as x approaches the boundary 
of E. If the transition probabilities admit an invariant probability measure p, on 
(E,B(E)), then (P t ) defines a contraction semigroup on L 2 (E,d/j,) =: L 2 (d[i) (see 
Bhattacharya (1982)). With slight abuse of notation, we denote its infinitesimal 
generator, which is actually an extension of A on L 2 (d/i), also by A, with corres- 
ponding domain TD>a C L 2 (dfi) and range 8% a- Together with the norm || • ||d a , given 
via ||<7||b a := ||<?||L 2 + ll^ffllu 2' tne cou pl e (®>a, II • ||nu) defines a Banach space (see 
Section 4). 
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In case of Ito-Feller diffusions, the generator A acts on Cj-? as a second-order dif- 
ferential operator, that is 

where a and b are a matrix field and a vector field, respectively, on M d such that the 
mappings x i— > a(x) and a; i— > are Borel measurable and locally bounded. C|? 
denotes the space of infinitely often continuously differentiable functions of compact 
support in the interior of E. The matrix a(x) is assumed to be positive definite for 
any x. The "carre du champ" operator on C]((R d ) x C^-(R d ) is defined by 

(2-2) T(g,g) := A(gg) - gAg-gAg = (Vg,aVg). 

Let 1 < p < oo. Subsequently, W m ' p (d/i) denotes the Sobolev space of m-times 
weakly differentiable functions g G L p (dfi) whose weak partial derivatives up to 
order m belong to L p (dfj,), too, equipped with the norm 

\\g\\w^.p(d^ ■= W d w9\\w 

\a\<m 



Here. 

9 S = a — — «I for all a S {0,l,...,m} d , 
and the derivatives are understood in the weak sense. 



Given any initial probability measure /?, let P^(-) := / B P x (-)(3(dx), denoting with 
P x the law of X starting in Xo — x. Then ((X t ) t >o,P ^) is a stationary process. 
Symmetric diffusions play a special role. A semigroup (Pt) t>0 is called symmetric 
with respect to /i, or /i is called reversible with respect to (Pt) 4 > ! if for any f,g £ 
L 2 (d M ) 

r /Ptpd/i - / <?P t /d/i. 



The generator A of a symmetric diffusion is self-adjoint, and it holds J E fAfd/i < 0. 



2.2. Continuity of Gaussian processes. A class of functions T c Ma on E 1 
is defined to be Donsker if there exists a tight Borel-measurable random element 
G of such that Gj G in Here, denotes convergence in law 

of random elements in the generalized sense of Hoffmann- J0rgensen (cf. Dudley 
(1999), Chapter 3). Tightness of G is equivalent to saying that G admits a version 
with almost all sample paths bounded and uniformly continuous on T with respect 
to the pseudo-metric 



(2.3) dl(g, g) := -2 f (g - g)A(g - g)dn, g,ge A^T. 

JE 
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General results on the sample path behavior of Gaussian processes prove useful 
for deriving Donsker theorems under necessary and sufficient conditions. Given a 
separable, infinite-dimensional Hilbert space H, a set C C H is called a GC-set if 
the restriction of an isonormal Gaussian process L on H can be chosen such that its 
sample functions are uniformly continuous on C. A set Q C Cq(P) = {/ G £ 2 (P) : 
P(/) = 0} is pregaussian if and only if the corresponding set in the quotient space 
Lg(P) = {/ € L 2 (P) : P(/) = 0} is a GC-set. 



2.3. Assumptions. We briefly summarize the fundamental assumptions for later 
purposes. 

(I) Poincare's inequality. The carre du champ T is said to satisfy a Poincare in- 
equality on L 2 (d/j.) if there exists some constant c > such that for all / € U>a 



(2.4) Var M (/) 



j fda - (J fdf?\ < c J r(/)d/i, 



where T(/) := r(/, /). For symmetric diffusions, Poincare's inequality is commonly 
referred to as spectral gap inequality since (2.4) is then equivalent to the existence 
of a spectral gap, 

Ai := sup {A > : E x - E = 0} = — > 0, 

cp 

where — / °° XdE\ denotes the spectral decomposition of A, and c — cp appears to 
be the smallest possible constant in (2.4). Furthermore, (2.4) is equivalent to the 
exponential decay of P t to the invariant measure fi in i 2 (d/x) (see, e.g., Theorem 
1.3 in Bakry et al. (2008)), 

Var,, (P t f) < exp f-^\ Var M (/) V / e L 2 (da). 



(II) Bound on the transition density. There exists some Co > such that for any 
u > t > and for any pair of points x, y € M. d , satisfying ||x — < u, we have 

Pt(x,y) < C (t- d / 2 + u™/ 2 ). 

Remark. In case of a constant diffusion coefficient in a stochastic differential equa- 
tion, assumption (II) holds in particular if the drift satisfies the at most linear 
growth condition (Qian and Zheng (2004), Theorem 3.2, with the choice q = 1 + 1 
in their notation). Additional boundedness of the drift then even allows to drop the 
u 3d/2_ term (Q ian ct al (2003), inequality (5)). 

(III) Uniform ellipticity. There exists some positive constant a such that the dif- 
ferential operator A in (2.1) satisfies the uniform ellipticity condition 



> a U\\l for all £ G R d \ {0} and for all y e E. 
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(IV) Symmetry. The semigroup (Pt) of transition operators is symmetric with re- 
spect to fJL. 

(V) Invariant density. The invariant probability measure on (E,B(E)) is Lebesgue 
continuous with density 7r which is bounded and uniformly bounded away from zero 
on any compact subset of the interior of E. 

Assumptions (I) - (V) seem to be rather natural. We briefly illustrate them for an 
example of a diffusion with reflecting boundary conditions. For ease of represen- 
tation, we restrict ourselves to reflecting diffusions on a one-dimensional interval, 
[0, 1], say. Consider the stochastic differential equation 

dX t = b(X t )dt + adWt + v{X t )dl tl 

with X t <G [0,1] for t > 0, W a standard Wiener process and (lt(X)) a non- 
anticipative process that increases only when X t <G {0, 1}, which is part of the 
solution. We assume that b : [0, 1] — >• R is bounded and measurable, a is positive, 
and that the function v satisfies v(§) = 1, 2/(1) = —1. The boundedness of b and 
positivity of a ensure the existence of a weak solution. The invariant measure is 
Lebesgue continuous with density 7r(x)da; ~ <r~ 2 exp(— J 2b(y)/a 2 dy)dx. Due to 
the compactness and the reflecting boundary conditions, the corresponding Markov 
process possesses a spectral gap. The associated operator A is self-adjoint and ellip- 
tic on L 2 (d/Lt) with compact resolvent. Its domain coincides with the subspace 
of W 2,2 (d/i) subject to Neumann boundary conditions. Assumptions (I)-(V) are 
satisfied. In order to avoid potential boundary considerations in the sequel, we re- 
strict attention to compactly supported function classes in the interior of E — even 
if E itself is compact. 

3. Parallels to the classical empirical process. In the sequel we focus 
on the special case of symmetric diffusion processes. Poincare's inequality can be 
seen as a minimal assumption for the following Bernstein-type inequality due to 
Lezaud (2001). Let g be a bounded measurable function with J gd^i = 0. Then 

(3.1) P,(±/p(X,)d,> f ) < ^- 2(g2 + c X| UprM ) ) ¥f ' r>0 ' 

where a 1 is given as 

a 2 := a 2 (g) := lim - f Var Pfl (j g{X u )du^ . 

We now state our first result which parallels Theorem 3.2 in Ginc and Zinn (1984). 
Note that in contrast to the classical empirical process, the symmetrization tech- 
nique is not available for the empirical diffusion process. Thus, the method of our 
proof differs from the proof in Gine and Zinn (1984). While they randomize in the 
asymptotic equicontinuity condition in order to apply the comparison inequality 
due to Fernique (1985), our proof relies on a random decomposition of the function 
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class T and requires in particular the generic chaining bound. Given a function 
class T and any 6 > 0, let 

Fs ■= {f-g- f,9 e F,d G (f,g) < 5} . 



Theorem 3.1. Let ((X t ), P M ) be a stationary, ergodic diffusion satisfying assump- 
tions (I) - (V). Denote by T C 3% a C L\(A[i) a class of uniformly bounded func- 
tions. Then J- is Donsker if and only if it is pregaussian and it holds for any rj > 0. 



(3.2) 



E* 



0. 



(-j/vT) 1 



E* and P* denote outer expectation and probability, respectively. 



Proof. Closedness of A implies that the null space Na ■= {h € E>^ : Ah = 0} 
is a closed subset of L 2 (dfi), that is, for any / g Ma, there exists some unique 
g € BU n Nj[ with Ag = f. For this choice of g, we have in particular 'E fJ _g(Xo) = 0. 
Convergence of the finite-dimensional marginals follows from martingale approxi- 
mation ^ 

±=(g(X t ) - g{X ) - J Ag(X s )ds) 

and the martingale CLT (see Section 4). It is therefore clear that the above con- 
ditions are necessary for T to be Donsker. Hence, it remains to prove asymptotic 
equicontinuity. Denote by (G(/))/ e j^ the limiting process whose sample paths are 
bounded and uniformly continuous with respect to dig. Sudakov's minoration (cf. 
Corollary 3.19 on p. 81 in Ledoux and Talagrand (1991)) implies that 

lim £\/\o% N(s, J", do) = 0. 

For fixed 77 > 0, define 5t :~ (■q/^/t) 1 / 2 and m t := N(St,J 7 ,dG)- Then there exist 
functions /1, . . . , / m , G J- such that dc,(fi, fj) > #t for all 1 < i ^ j < mt and 
\/log mtdt — > 0. Application of the triangle inequality yields the decomposition 

(3.3) sup |G t (/)| <2 sup |G t (/)|+ max \G t (fi-fj)\, 

feT v feT St i<^<™ t 

which goes originally back to Gine and Zinn (1984). We bound the second term in 

(3.3) . For any / e (T - J 7 ), define 

X U := f(X u )du, o) := E^/), cj := ||/|| 8up . 

We consider the decomposition 

2 2 

(3.4) \x f ,t\ = \x ftt \ a{ \x f ,t\ < ^V~t} + \x f>t \ i{ \x f ,t\ > ^Vt). 
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It then holds 
(3.5) P p 
and 

(3-6) P, I 



Lx f!t \t{\x fit \<^Vi}>x\ 

\X f , t \l{\Xf, t \ > ^-V~t}>xj 



< 2 exp 



< 2 exp - 



Thus (cf. the proof of Lemma A.l in van der Vaart (1996)), for some (universal) 
constant K sufficiently large, not depending on c/ and a/, 



x ftt \H\x 



{\x f ,\ < ^Vt} 



t 



\ 



Ka 



f 



< 1, E^i 



X 



f,t 



l{\X,A>%yft} \ 



V 



Kc f /Vi 



where ipp are the Young functions ip p (x) = cxp(x p ) — 1, p — 1,2. Then inequality 
(2.10) in Arcones and Gine (1993) implies for c' := supy g jrC/, 

( „2 ~\ 



E, 



max \X 

l<i^j<mt 



fi-fj,t 



h-fi,t 



> 



2c' 



V~t\ < Klog(m t )^=. 



c h-h 

For estimating the first term in (3.4), we use the generic chaining bound in Theorem 
1.2.6 in Talagrand (2005), i.e. 

C ~2 



E„ 



max \X 



fi-fj,t 



KUX 



< 



'fi-fj 

c h-h 



Vt\<L l2 (T, 



1 1 ■ < 



for some constant L > 0, with 72 denoting the 72-functional (cf. Talagrand (2005), 
Definition 1.2.5). This completes the verification of the asymptotic equicontinuity 
condition, noting that log(m t )/\/t — > as t — > 00 by Sudakov's inequality, while 
72(^*7), d&) — > as 77 — > by pregaussianness. □ 



The above description of the Donsker property characterizes the effect of pregaus- 
sianness on the asymptotic equicontinuity condition but is of little interest for 
applications. Condition (3.2) remains to be verified, and for the latter purpose the 
concept of VC classes does not seem to be suitable for empirical diffusion processes. 
Moreover, the result that a function class satisfying a uniform entropy condition is 
Donsker is proved by means of symmetrization arguments and is equally not easily 
transferred to the diffusion setting. In contrast, an analogue of Ossiander's classical 
bracketing CLT holds. 

Theorem 3.2. Let ((X t ) ,P be a stationary, ergodic Feller diffusion satisfying 
conditions (I) - (V). If T C Lg(d/i) satisfies 

(3.7) J y / logiV [] ( £ ,J-,L 2 (d / i))d £ <(X), 

then T is Donsker. 
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Here N\ 1 (e, J 7 , L 2 (d/x)) denotes the £-entropy with bracketing, that is, the smallest 

number of e-brackets (in L 2 (d^)) which are required to cover T (cf. van der Vaart and Wcllncr 

(1996), Definition 2.1.6). 

Proof. Ergodicity yields that Na is one-dimensional (Bhattacharya (1982), Propo- 
sition 2.2). Since (Pt) is a strongly continuous semigroup on L 2 (d/i) (implying that 
IS a is dense in L 2 (d^)), it follows that Ma = {/ € L 2 (dfi) : j /d/i — 0} by the spec- 
tral gap inequality (see Bhattacharya (1982), Remark 2.3.1). Convergence of the 
finite-dimensional marginals of the empirical process (fit{f))f^j: now follows from 
martingale approximation and the martingale CLT, see (4.4) in Section 4. It remains 
to prove asymptotic equicontinuity. Ossiander's result is about L 2 -bracketing. The 
proof of this classical bracketing CLT as given in Dudley (1999), pp. 239 - 244, is 
based on chaining arguments which are also valid when the pseudo-metric as 
defined in (2.3) is used. The only ingredient of the proof which does not apply in the 
diffusion context is the classical Bernstein-inequality which can be replaced with the 
Bernstein-type inequality (3.1). Furthermore, by Cauchy-Schwarz and Poincare's 
inequality, 

Var M (Gt(A ff )) = (g,Ag)^ < \\g\\^Ugh,2 < \\Ag\\l 2 . 

Here and subsequently, < means less or equal up to some constant which does 
not depend on the variable parameters in the expression. Since tig (A -1 /, A~ l h) < 
II/— the bracketing entropy numbers with respect to dg, can be upper-bounded 
by i 2 (d/i)-bracketing. □ 

Remark. It is also possible to discretize the empirical process and to work with 
the discretized version, exploiting some mixing properties. In particular, the sym- 
metrization device can be applied after suitable decoupling to the discretized ver- 
sion, such that sufficient conditions ensuring asymptotic equicontinuity (such as 
Vapnik-Chervonenkis type conditions) can be derived. We do not pursue this strat- 
egy here but refer the reader to Rio (2000) for further results in this spirit. 

4. An intermediate process indexed by smoothed functions. Al- 
though the previous section reveals strong parallels, the work of van der Vaart and van Zantcn 
(2005) bares substantial differences in the regularity behavior of the empirical pro- 
cess of scalar diffusions as compared to the classical empirical process based on 
independent and identically distributed random variables. Indeed, they show for 
the empirical process of a one-dimensional diffusion with finite speed measure that 
pregaussianness already implies the Donsker property. The explanation lies in the 
Lebesgue continuity of the empirical measure due to the occupation times formula, 
i.e. the existence of diffusion local time. For higher dimension, diffusion local time 
does not exist and so this reasoning breaks down. 

However, mean-integrated squared error bounds for the estimation of the invariant 
density at a fixed point in Dalalyan and Reifi (2007) as compared to those for 
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probability density estimation based on independent and identically distributed 
random variables strongly suggest that there is some increased regularity also in 
the multidimensional diffusion case. Thus, one might hope that this effect is getting 
visible when studying some smoothed version of the empirical diffusion process 



(4.1) {&t,h(J))fer ■= (Vi J^f(x)w t , h (x)dx\ 
based on some kernel estimator TT t ,h of the invariant density 7r. 



For independent and identically distributed random variables X%, X n ~ P, where 
P is some probability measure on (R d , #(1^)) with Lebesgue density p, smoothed 
empirical processes (yfn J f(x)p n (x)dx)f e jr with some density estimator p n have 
been studied very recently, having regard to Donsker theorems under necessary 
and sufficient conditions. Starting from the hypothesis that J- is pregaussian, The- 
orem 3.2 in Gine and Zinn (1984) simplifies the problem of verifying asymptotic 
equicontinuity to the issue of proving some symmetrized analogue of (3.2), using 
the closeness of the density estimator to the density in the mean squared sense. 
This has led already in the one-dimensional case to a lower bandwidth bound 
nh^ oo, if p n is some kernel density estimator with bandwidth h n . Precisely, 
Radulovic and Wegkamp (2009) use the decomposition by the Cauchy-Schwarz in- 
equality, 



E* sup 
(4.2) 



n / f(x)(p n (x)-Ep n (x))dx 



(p n {x) - Ep n (x)) 2 \2 / [Vaxp n (x) 



<«• sup (v^(j ^ ^Y^j^r^ 

In case of the smoothed empirical diffusion process, this approach is not suitable 
because the constrained set J-g in Theorem 3.1 is defined by means of g?g which is 
only upper bounded by \\A- due to the Poincare inequality. Moreover, the above 
approach needs that the variance decreases to zero slightly faster than 1/y/n. Our 
proof for the smoothed empirical diffusion process is conceptually different. We do 
not use explicitly the closeness of 7?t, h to n in a mean squared sense. Remarkably, 
the performance of the smoothed empirical diffusion process we consider can be 
guaranteed even if the bandwidth is too small for ensuring consistency of T^t,h- 



Analyzing the intermediate process. In order to circumvent the problematic 
verification of the equicontinuity condition (3.2), we study as a preliminary object 
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the "empirical process indexed by smoothed functions" (R tt h(g))geG^ defined by 

1 f* 

(4.3) H tih ( S ) := G t {A (g * K h )) = -= / A(g*K h ) {X u )du, 

Vt Jo 

where Kh(x) := h~ d K{x/h) for some compactly supported kernel K on R d with 
J KdX — 1. The next proposition guarantees that the process is well-defined on 
any subspace of the domain Da which is locally invariant under translation. We 
formulate a slightly more precise statement. Note that the result does not need any 
further specification of A and in particular of Da in terms of Sobolev spaces with 
boundary conditions. 

Proposition 4.1. Assume that g{- + uh) G Da for h G [0, ho], u £ S^ 1 , such 
that \\g(- +uh)\\o A is uniformly bounded in h < ho,u 6 S^ 1 . Then g* Kh € Da for 
h € [0, hg], and the convolution is contained in the \\ ■ ||d ^-closure of \\K\\tv times 
the symmetric convex hull of {g(- — y) : \\y\\2 < ho}. 

Here, || • ||tv denotes the total variation norm. The proof of Proposition 4.1 is based 
on the subsequent lemma. 

Lemma 4.2. Let A satisfy the Poincare inequality. Then (Da, | ■ ||d a ) * s a sepa- 
rable Hilbert space. 

Proof. It is clear that (Da, || • ||d a ) i s pre-Hilbert. In order to prove completeness, 
l et (5n) neN be a Cauchy sequence in (Da, || • \\o A ). Then (.g™)„ eN and (Ag n ) neti are 
Cauchy sequences with respect to || • ||^,2- Completeness of L 2 (d/j,) implies that there 
exist some g such that \\g — flVill^ 2 — ^ an d some G such that \\G — Ag n \\ 2 — > 0. 
Since A is closed, it follows G — Ag, and, in particular, g 6 Da- It remains to prove 
separability. Note that £%a C £ 2 (d^) is separable as a subset of a separable metric 
space. Let {f n ) ne ^ be a dense subset of &a, and let (<7n) ngN be a dense subset in 
Da H Aa, where Na denotes the null-space of A which is a closed subset of L 2 (d[i), 
since A is closed. For any set S C Da, let A\ s denote the restriction of A to S. 
Then the set 

{ A \N^no A (/")) n£N U (-9«)neN 

is countably dense in (D^, || • ||d a )- For the proof, let g 6 D^ be arbitrary. Such g 
can be written as g = g + go for some g 1 - £ Da D Nj[ and some go E Na- It holds 
||^ X IU,2 = \\Ag\\n, 2 . Now let 

^ c {V*nD*(/») :neN } 

with — p ) || ju,2 — ^ 0. Poincare's inequality then gives \\gj: — #^11^,2 — > 0. 

Furthermore, let 5° C {#„ : n <E N} such that ||^ — g || 2 0- Thus, [g^r + g%) is 
the desired approximation. □ 
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Proof of Proposition 4.1. By Lemma 4.2, Da is closed under finite convex 
combinations and separable. That the convolution g*Kh is contained in the || • \\a A - 
closure of \\K ||tv times the symmetric convex hull of {<?(• — y) '■ \\yW2 < ho} now 
follows the lines of the proof of Lemma 1 in Gine and Nickl (2008), replacing L 2 (dQ) 
by (Da, || ■ ||nu)- As concerns the first statement of Proposition 4.1, it remains 
to note that the above || • \\n A -closure is again contained in Da because of the 
completeness of (Da, || • ||o A ) by Lemma 4.2. □ 

We now present the first result of this section. If not explicitly stated otherwise, 
K : M. d —> M. denotes subsequently some twice continuously differentiable kernel of 
compact support which admits the representation K^ x {z) = h~ d K(\\x — z\\2/h). 
Without loss of generality, we assume that the support of K is the closed d- 
dimensional unit ball Bq(1). Subsequently, (G(/))/ e jr denotes a centered Gaussian 
process with covariance structure 

cov(G(/),G( ff )) = - J ([A- 1 /] 5 +[A- 1 5 ]/)d/i, f,g£F. 

Theorem 4.3. Let ((X t ),P ^) be a stationary and ergodic Ito-Feller diffusion in 
E C M. d with non-empty interior E\dE , satisfying the conditions (I)-(V) of Section 
2. Assume that ^|^i nD4 -^ r * s a subset o/W 2 ' 2 (d/i) and possesses a fi-integrable 
envelope G of compact support C in the interior of E. Let 

hf :=t- l ' d \og{et) 

If J- is pregaussian, then 

(M tM (A-^ no j)) f ^ - (G(/)) /e ^ in t°°{F), 

provided that h t = h\ d ^ \ and h^ = 0(h t )■ 

PROOF. Step I. (Convergence of the finite-dimensional distributions) 

Recall the definition (4.3). Denote Q = ^^1^^ C Da n Nj[. Since X is an 

Ito-Feller diffusion, C\ C Da, and we have g * Kh £ Da for every g £ Q and 
h sufficiently small. Hence, J A(g * Kh)d/i — f Agdfi = 0. By Dynkin's formula, 
(Mf)t>o with Mf = g(X t ) — g(Xo) — f Ag(X u )du is a martingale, and letting 

(4.4) M B tht := g * K ht (X t ) - g * K ht (X Q ) - / A(g * K ht )(X u )du, 

Jo 

(t _1 / 2 M ( s /lt ( s )) 0<s<t i s a triangular array of martingales. For any natural number 
m, let gi, g m be some arbitrary collection of elements from Q . We will prove that 
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— M 91 

rt m t 



y/t 



op(l) as t — >• oo. 



The convergence of the finite-dimensional distributions then follows from the mar- 
tingale CLT. For note that by the assumption of the integrable envelope, 



e; sup 



1 



m ttht ( J g)--^M« ht 



(4.6) 
(4.7) 
(4.8) 



2 

Vt 

2 



G(z-y)\K ht \(y)dX(y)dfi(z) 



< —J I G{z-y)\K ht \{y)dX{y)dX{z) 



-j= / / G(x)dX(x)\K ht \(y)d\(y) 



1 

7T 



< — E M G(X ) 



as t — > oo, 



where we used assumption (V) in (4.6) and (4.8) and Fubini's Theorem and the 
translation invariance of the Lebesgue measure A in (4.7). Furthermore, 



max 

i— 1, m 



> £ 



< m max P, 



i(M« t (t)-Mf(t) 



m 

< max 

£ J i 



> £ 
2 



(4.9) 



rn 

'72 



max 2 (g i -g i *K ht )A(gi-g i *Kh t )d[x, 

= l,....m J 



where the expression for the variance in (4.9) is deduced in Bhattacharya (1982). 
As it appears useful for later purposes, we establish at this point the identity 



(4.10) 



2 / gAgd/j, 



(V w g)*a(-)V w gd/i 



for all g E W 2 ' 2 (d/i) of compact support in E\dE, with the matrix- valued function 
a(-) of the representation (2.1). Indeed, the identity (4.10) is known to be valid for 
arbitrary g G C\, because of E A1 A(g 2 ) = and (2.2). Let g e W 2 ' 2 (d^i) of compact 
support in the interior of E be arbitrary, and let <t>h{-) = h~ d (j){-/h) be a Dirac 
sequence, where <f> is assumed to be compactly supported and twice continuously 
differentiable. Thus, for sufficiently small h, g * (f>h € C\, and (4.10) holds true 
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for g * <j)h- Since the diffusion coefficient matrix is locally bounded, a is uniformly 
bounded on compacts coordinatewise, hence, 



(V(s- g* <f>h)) a(V(5 -g* fa))nd\ 
(4.11) = (V w g-(V w g)*fa) t a(V w g-('V w g)*fa)irdA 



= o(l) as ft \ 

since fa is a Dirac sequence (cf. Theorem 8.14 in Folland (1999)), where we used 
Lemma A.l in (4.11). As in Lemma A. 2, one can show that 



-2 / gAgdfi = lim -2E M (g * fa)A(g * fa), 

J h\0 



which proves the identity (4.10). 

Let || • ||f denote the Frobenius norm. Using now (4.10), Lemma A.l and the fact 
that sup y6C ||a(j/)7r(y)||jr is bounded, expression (4.9) is bounded by 



< 



(V w gi(x) - (V w gi) *K ht (x)) (V w gi(x) - {V w gi) * K ht (x)) d\(x) = o(l) 



as t — > oo, since (Kh t ) defines a (possibly not non- negatively valued) Dirac sequence. 
This proves finally (4.5). 



Step II. (Asymptotic equicontinuity) 
Since 

I 

— > as t — > oo 



E* sup 

g£G 



as shown in Step I, it is sufficient to prove the result for the triangular array of 
approximating martingales (t~ 1 ^ 2 M^ h ) g ^g. Define 

, ^/(t- 1/2 ML t -t- 1/2 Ml ht ) t 
\\M t . ht \\ da := sup . 

g,g£Q--d G (g,g)>0 a G(g,g) 

For any fixed K > and e > 0, 

1 

t— ¥oo \d G (g,g)<8 



lim sup P M ( _ sup | M a tM - M^ t | > e 



< limsup P M ( sup -1| M^ t - M t \ | > e; ||M t ,h t ||d 5: < if 
+ limsup P M (||M t)/l J do > if) 

t— >oo 

=:/ + /!, say. 
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We first treat expression /. It follows from Bernstein's inequality for continuous 
local martingales (see, e.g., Revuz and Yor (1999), p. 153) that 



1 



>e; l|Af tlfct |L <K)<2exp 



2K*d? G (g,g)J' 



that is, the random map g i-> ^jM^Uj ||M tj /j t < K} is subgaussian with 
respect to Kdt&. Hence by pregaussianness, lim^o lim sup^^ 1 = 0. 

We now study expression II. Recall the representation of A according to (2.1). By 
Ito's formula, 



(4.12) 



1 



< P"IU,i 



{{V w g)*K ht (X u )Ya(X u ) ((V w g) * K ht (X u )) dA(u) 
1 



{V w g)a{X u )(V w g)) * \K ht \{X u )d\{u) 



\\K\\ K i / (V w g(y)Y 



a(X u )\K ht \(y - X u )du (V w g) (y)d\(y), 



where we used Holder's inequality in (4.12). It remains to verify that 
fcWM? (7/0 a(X u )\K ht \(y - X u )du) (V w g(y)) d\(y) 



sup ■ 

see 



J E i^wg) a (V^g) 7rdA 



P (1). 



Let Xi, ...,xpf h be an /i t -net of C with respect to the Euclidean distance, whence 
Nh ~ h^ d by compactness of C. The uniform ellipticity condition and the fact that 
miy^c 77 (y) > imply that the last equation holds true whenever 



sup 



a{X u )\K ht \{y - X u )du 



sup sup 

k=l,...,N ht yeB^^ht) 



< sup 

k=l,...,N h 



a(X u )\K ht \{y - X u )du 
1 /"* 

- / a(X u ) sup \K ht \(y - X u )du 
t Jo yeB x ,(h t ) 



< sup 

k=l,...,N h 



y£B Xk {h t ) 
a(X u )h^ d t Bx (2h t )(X u )du 



(4.13) 



< sup 

k=l,...,N h 



a(X u )h t d l B r 2 h t )){ X u)du - I a{x)h t a l B ( 2 h t )( x k ~ x)iT(x)dx 



sup 

k=l,...,N h 



a(x)h t d U Bxt ( 2/lt )(x fc - x)ir(x)dx 



= Op(l). 
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But 



sup 



j ' a(x)h t d l By (2h t )(v - x)tt(x)>1x 



< sup \\a{y)\\ F . 
f ytc 

Hence, it remains to prove that expression (4.13) is bounded in probability. This 
turn out to be a crucial step in the proof. For this purpose, first empirical process 
techniques are employed which cover any dimension d > 2. One requisite is Propo- 
sition 1 in Dalalyan and Dalalyan and Reifi (2007), rephrased for our notation: 

PROPOSITION 4.4. Let C C E C R d be bounded and assume that /i < C\ on C 
for some positive constant C . Then, 

v 2 



Var, 



1 



VtJo 



S- d 1L By(S) (X u )du) < D'S-^XiByWycjiXiB^S))) 



with 



max{l,(log(l/:r)) 2 } if d = 2, 



where D' denotes some constant which depends on cp,Co,d and C only. 



First observe that 

(4-14) ( d (\(B x (h t ))) < UK)- 

For given fixed i, j E {1, . . . , d}, define for x g C and t, h > 0, 
(4.15) 

Ztt ■= ^(jj a^X^h^B^^iX^du^^a^X^h^B^^iXa) 

Now, (4.14) and Proposition 4.4 imply that Var M (Z%* t ) < sup^ eC |ay (z)\ 2 (j(hf ) 
for all x £ C and t > 0. With 



<4dx) := limVarp M (Z^) < sup \a %3 {z)\Xi{h< 1 ) 



zee 



and 



Coo,/! := sup\dij(z)h d 1 B .(2h)(z)\ < h d , 

zee 



Lezaud's (2001) Bernstein-type inequality yields the exponential tail bound 

u 2 /2 \ 



> u ) < 2 exp 



Vu > 0. 



Thus, the same decomposition as in the proof of Theorem 3.1 and application of 
Pisier's maximal inequality to each of the terms subsequently provide the bound 

(4.16) 



E 



It SU P /ri 



1 , if JlogfehT 1 ^ r 
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Turning back to the proof of Theorem 4.3, it suffices to note that expression (4.13) 
is bounded in probability whenever (h t ) is chosen as stated in the formulation of 
Theorem 4.3, due to (4.16). This finally completes the proof. □ 

Remark. Note that the condition y/nVa,r(p n (x)) — > which is required using the 
decomposition given in (4.2) is connected with the lower bound h n ~ n~ 1 / 2d on the 
bandwidth for the classical smoothed empirical process (the order of the variance is 
uniform over x) while in the present Theorem 4.3 the bandwidth h t ~ t~ x / d log(ei) 
is admissible. 

The efficient use of Pisier's inequality requires some kind of uniformity in the tail 
decay in t and h t , and it is the basic tool for the maximal inequality based on 
classical chaining techniques. In the present situation, the random variables Z x \ 
are very localized with increasing sharp maximum for h t decreasing to zero, i.e. 
with exploding supremum norm. Thus, although the variances of the increments of 
the empirical diffusion process are much smaller than those of the classical empirical 
process, it appears that the use of a refined maximal inequality by I2 — /oo-chaining 
does not lead to any tighter bound. The results obtained by the techniques of the 
proof of Theorem 4.3 are not optimal in the one-dimensional case which suggests 
that improvement is still possible by a different approach. Therefore, we revisit the 
bound on 



(4.17) sup 



a(X u )\K ht \(y - X u )du 







in a very specific two-dimensional case, employing results from fractal analysis of 
planar Brownian motion. 

Proposition 4.5. Let W denote some planar Brownian motion. Let ht \, with 
exp(-t) = 0(h t ). Then 

1 



limsupsup - / — ^-^ du = Op(l). 

t-s-oo x£C t Jq h t 

PROOF. Without loss of generality we assume C C B (l). By change of variables 
and the rescaling property of Brownian motion, it holds for any t > 1, h t < 1, 

f 1b.(m(W») a . f 1 lB A h t ){Wtu) 
sup / ^5— ■ -du = sup t / -^7- = -du 

\\x\\<iJq hf(\ log /i* I + flog*) \\ x \\<! J Q hf\loght - f logtj 



^ B x /Vt(ht/Vi) ft~ 1/2 Wiu) 

\M<Jo (h t /Vi) 2 \log(h t /Vi)\ 
r 1 t Bx (r){W u ) 



r<h t \\x\\<l JO 



-du, 
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and the latter term is bounded in probability - precisely, equals C + op(l) for some 
constant C > 0, due to Theorem 1.2, Dembo et al. (2001). In particular, 

1 f'kfMfW"), _ n /| log/Hi + I log* 



sup Z / Zi du = 0p ^ = Op(1) 

if (/it) is chosen as in the formulation of the proposition. □ 

Some well-known technique for proving results about diffusion processes is to reduce 
the problem via martingale representation theorems to standard Brownian motion. 
Thus, if the ergodic, stationary Ito-Feller diffusion can be represented via some 
diffconiorphism $ : M 2 — > M 2 from a local martingale in M. 2 satisfying the conditions 
(I) - (V), we may replace F by Fo $ and (Xt) by ($ _1 (X t )) in the definition of the 
empirical diffusion process, which makes it sufficient to investigate the intermediate 
process for the local martingale (3> _1 (X()). The aim of the next result is to give a 
hint on further potential improvement of the bound on (4.17) for some specific case 
of local martingales. 

Theorem 4.6. Let ((Xt),P fj,) be some isotropic, local martingale in R 2 which is 
representable as an analytic function of planar Brownian motion B with f 0, 
i.e. X = f{B). Let F(u) := = /„" \f'{B s )\ 2 ds and define K C ,t,r to be 

argmax{A({« G [0,F(t)] : \f(B F - Hu) )\~ 2 > «}) > Cr 2 {\ logr| + logF(t))} 
for any C > 0. Let h t \ satisfy 

(4.18) ^ j F{t) \f'{B)\- 2 l< ,d« = Op(1) 

hp Jq \\f ( b )\ f -i (u) > K c(i+o-p(i)),t,h t } 

as t — > oo and suppose that conditions (I) - (V) are satisfied. If 

possesses a fi-integrable envelope G of compact support C C R 2 and F is pregaus- 
sian, then 

(^aV.W))^ - TOW » 

provided that ht\,0 and exp(— t) = 0(h t ). 

Proof. The proof follows the lines of the proof of Theorem 4.3 until the bound on 
(4.17). The process (Xt) describes some isotropic, local martingale and possesses 
therefore the representation Xt — Bo(X^)t with a Dambis-Dubins-Schwarz Brow- 
nian motion on some standard extension of (Ft)- Due to its specific structure, the 
quadratic variation process can be written as (X^). — J„ |/'(B)| 2 ds. By change of 
variables, 
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Now, since 

rF(i 

\\x\\<l JQ 

by the proof of Proposition 4.5 



,F(t) 

sup / \ BAht) (B u )du < h 2 t (\\ogh t \+logF(t))(l + o P (l)) 

x\\<l JO 



1 [ {X{))t \-t>(n\\-2 ^B*{h t ){B u ) 



1 



F(t) 



nft J r w \\f (■ B )l P -i w >«c(i+ p(i»,t,j lt / 

for some constant C > 0, which proves the theorem. □ 



5. The analysis of the smoothed empirical process. The remarkable 
regularity properties of the process M. tt h clarify the influence of the regularity be- 
havior of the empirical measure for diffusions. Instead of following the guideline 
suggested by Theorem 3.1 for the analysis of St,h, it therefore appears useful to 
relate it to Mt,h under suitable smoothness assumptions on the coefficients of A. 
Before discussing the general approximation set-up, we present a simple condition 
ensuring the equality A(g * Kh) — (Ag) * Kh- 



Lemma 5.1. Assume that for some fixed h a > 

P t g(- -z)- g(-z) 



(5.1) 



lim sup 

t N.o zeBo (/i ) 



t 



- M- - z ) 



= o. 



M,2 



Then g * Kh € 3a, and A(g * Kh) = (Ag) * Kh /i-a.s. (h < ho). 



Proof. We will show that under the conditions of the lemma, (Ag) * Kh is the 
i 2 (d^)-limit of t^ 1 (P t (g * K h ) — g * K h ) as t \ 0. Using the Jensen inequality and 
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Fubini's Theorem, we have 

\\P t {g*K h )-g*K h ^ 
{Ag)*K h 



t 

/I 

E 



li,2 



M (i [E(g * K h (X t ) | X ) - g * K h (X Q )] - (Ag) * K h (X ) 



E M ( E( j [g * K h (X t ) - g * K h (X Q )] - {Ag) * K h (X ) 

, 2 

^0) 



X { 



< E M (i [g * K h {X t ) - g * K h (X )] - {Ag) * K h (X 

( J {\[g{X t -z)- g(X - z)} - (Ag){X - z)}K h {z)6zj 



' I|A'||a.,E ( , / \l[ g (X t -z)-g(X -z)] - (Ag)(X - z)} 2 \K h \{z)dz 



< \\K\H, sup EjUg(X t -z)-g(X -z))-{Ag){X Q - Z )) 2 

zeBo(ha) Kt 



□ 



If no additional information about Q is available, the applicability of Lemma 5.1 is 
very limited. Of course, the identity A{g * Kh) — {Ag) * Kh holds always true if the 
coefficients of A are constant in an open set which contains the support of g. Let 
/ : E — > R be a continuous function. Then its modulus of continuity on any convex 
set K C E is denoted by 

6{f,A;K) := sup \f(x)-f(y)\, < A < 1. 

x,y£K:\\x — y||2<A 

The next Theorem links the intermediate process M ty h to the smoothed empirical 
diffusion process St,h via regularity conditions on the local characteristics of the 
describing stochastic differential equation. 

Theorem 5.2. Grant the requirements of Theorem 4-3 on {{X t ), P^) and A~^ ±nn ^J- . 

Assume in addition that the function G of Theorem J^.3 is also envelope of the 
spaces of first and second order weak partial derivatives of elements of A^^^^T . 

Let h t — h[ \j such that h[ = 0{h[ d ^). Assume that the characteristics of the 
stochastic differential equation satisfy the subsequent conditions: 

(i) Vt maxjj 8{a i:j , h t ; C e ) = o(l). 
(ii) \/tma,Xi5{bi,ht',C E ) = o{i) and 

(Hi) \/tS{Tr 7 h t ; C e ) — o(l) for some e -neighborhood C E of C . 
Then, if J- is pregaussian, 



DONSKER THEOREMS FOR MULTIVARIATE DIFFUSIONS 21 



'gee 



o P (l) 



Proof. Recall the definitions (4.3) and (4.1), and let Q := A,„ A „ T . In view of 
Theorem 4.3 it is sufficient to prove 

(5-2) (^t,M) geg = &j H (Ag)) c 

First, 

Vt\E^(f*K ht )\ = Vt\^(f*K ht -f) 

= Vi \ / f(y)K ht (x - y)dyn(x)dx - I f(x)n(x)dx\ 



Vt\ f /(*/)(/ {-K{x)--K{y))K ht {x-y)dxy y 



< \\K\\ Xtl \\f\\ XA ViS(n,h t ;C e ). 

The uniform boundedness sup^gjr < oo follows from the continuity of a and b 

and therefore, their uniform boundedness on compacts, the existing envelope G on 
the weak partial derivatives in the expression of / = Ag as well as the equivalence 
of A ~ fi on the support of G. Furthermore, using the abbreviation 



D 2 w g 



d 2 



dxidxj 



<?(•) 



we obtain 



\gee Vt 



A(g * K ht ){X u ) - (Ag) * K ht (X u )di 



< ViE*Jsup\A(g*K ht )-(Ag)*K ht \) 

Vgee / 

< V [ sup { tv((a{y)-a{x)) t Dlg(y))K h {x-y)dy 

JE g€G J 



V I sup 

IE gdQ 



d[i*{x) 
d[i*(x) 



(b(y)-b(x)) V w g{y)K h (x-y)dy 

< V(max5(a ij ,h t ;C £ )+max6(b i ,h t -,C £ ))F,jG* \K ht \(X )) 

< Vt ( max 6(aij, h t \C e ) + max5(bi, h t ; C £ ))E AI G(X ), 



where the last < follows by the same reasoning as in (4.6) - (4.8). These findings 
entail (5.2). □ 

6. Example. We concretize our results for the following specific setting. Sup- 
pose that the diffusion is described by the stochastic differential equation 



(6.1) 



dX t = b(X t )dt + crdWt, 0<t<T, 
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with drift b : M. d — > M. d and d-dimensional Wiener process W. It is assumed subse- 
quently that there exists a potential V € C 1 (R d ) such that b = — VV. If the drift 
term b satisfies the at most linear growth condition ||6(x)||2 ^ (1 + IMI2), then 
equation (6.1) admits a unique weak solution (Karatzas and Shreve (1988), Propo- 
sition 3.6). If expo(— 2V) E L 1 (R d ,dX), there exists a unique invariant measure 
(Bhattacharya (1978), Theorem 3.5) which is Lebesgue continuous with invariant 
density 

tt(x) = ( I cxp(-2^(w))du) exp(-2V(x)), x € R rf , 

(Lorenzi and Bertoldi (2007), Theorem 8.1.26). Suppose further that the process 
starts in the equilibrium, i.e. X ~ /J,. 

For any convex set / C R d , let Hd(P, L; I) denote the isotropic Holder smoothness 
class, which for j3 < 1 equals 

%i{fi,m) := {</>:/^R: | <j>{x) - 4>{y) \ <L\\x-yf 2 }. 

Let L/3J denote the largest integer strictly smaller than (3. For j3 > 1, Hd{P,L;I) 
consists of all functions /:/—>• M that are [/3\ times continuously differentiable 
such that the following property is satisfied: if Py denotes the Taylor polynomial 
of / at the point y € I up to the |_/3J -th order, 

f{x)-P^\x) < L\\x-y\\$ ioraRx,y€l. 

Corollary 6.1. Let V e "H 2 (/3 + l,L;R 2 ) for some L > 0, (3 > 0. Suppose 
b = —W satisfies the at most linear growth condition, expo(— 2V) G L 1 (M. 2 ,dX) 
and 

(6.2) max max I d a bi(0) I < 7 

i=l,...,da:\a\<lfH 

for some 7 > 0. Let A~^ ±nn ^J- satisfy the requirement of Theorem 5.2, where 
C C E is convex. Let one of the following conditions be satisfied: 

(i) (X t ) = ($(M t )) for some isotropic local martingale (M t ) in M 2 satisfying the 
conditions of Theorem 4-6, and assume that there exist constants S, A > 
such that 

(6.3) B x {5r) C <S>-\B x {r)) C B x (Ar) Vie C, < r < 1. 

Suppose that h t := t~ n for some rj > max (1/(2/3), 1/2) satisfies condition 
(4. 18). 

(ii) Let h t ~ ^ 1/2 log(ei) and [3 > 1. 
Then 
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Proof. In case of (i), first note that \Zih% = o(l) for arbitrary to e [/3, 1]. Now, 
ViS(h,h t ;C e ) < 



(LVth^ for Pe (0,1], 

|sup Q: | Q | =1 su P;! , 6C \d a b t (x)\Vth t for (3 > 1 



Furthermore, V € Hiifi + 1, R 2 ) and (6.2) together imply that 7r is Holder con- 
tinuous of order (3 + 1 on every bounded set if C M 2 . Thus, since d a ir is continuous 
and hence, uniformly bounded on C e for all a € {0, l} 2 with |a| = 1, 



VtS(n,h t ;C E ) = \ft sup | ir(x) — ir(y) \ < V~t2 max sup \d a Tr(x)\h t — o(l). 

x,y£C e : 

\\ x -y\\2<h t 



*:|a|=lxeC e 

The result follows now from Theorem 5.2, using (6.3) and noting that 



(4.17) < sup 



t / lq,-i {By (h t ))(M u )du 



which allows the same estimate for this expression as used in Theorem 4.6. Similar 
considerations reveal claim (ii). □ 



REMARK. The case of dimension d = 2 takes an exposited place. Here, the possi- 
bility of smoothing via even exponentially small bandwidth, though in a very special 
case, can be possible. It is worth being noticed that the Holder regularity condi- 
tions on the local characteristics of the describing stochastic differential equation 
in Corollary 6.1 (i) are much weaker than what is typically imposed for ensuring 
existence and uniqueness of a strong solution. Note that no specific isotropy struc- 
ture of the diffusion is required in (ii) to allow ht ~ i -1 / 2 log(ef) which makes the 
result of Corollary 6.1 applicable in general whenever j3 > 1. 

Of course, imposing tighter conditions on the modulus of continuity of the lo- 
cal characteristics also enables to establish the approximation (5.2) for the d- 
dimensional case with d > 3, but these are not satisfied for a, b and tt with co- 
ordinates belonging to some smoothness class of the Holder type. While the result 
in Theorem 5.2 does not involve any further regularity constraint on the function 
class the subsequent Theorem benefits of combined smoothness of the local char- 
acteristics and the function class F at once. Recall that some function K : IR d — > R 
is called kernel of order / for some integer I > 0, if the functions u i-> u\K{u), 
j = 0, 1, / and i = 1, d, are integrable and satisfy 

J K(u)dX(u) = 1, J u{K(u)dX(u) = 0, j = l,...,l, i = l,...,d. 
Here, Ui denotes the i'th coordinate of u £ M. d . 

Theorem 6.2. Let V € Udifi + l,L;R d ) for some L > 0, /3 > d/2. Suppose 
b = — W satisfies the at most linear growth condition, expo(— 2V) € i 1 (M d ,dA) 
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and 
(6.4) 



max max I d a bi(0) I < 7 

i=l,...,da:\a\<[P\ 1 1 



for some 7 > 0. Assume that Q = A^^^^J 7 satisfies the requirement of Theorem 
5.2, where C C E is convex, Q C C 1 (M d ) and 

{d a g : S e5,H=l,ae {0, C - 1, L; C) 

for p > d/2. Let hf* := r 1 / d log(ef). Then 

(W/)W - (G(/)) /&F inf°(n 
provided that the involved kernel K is of order 2 1_/3 J — 1 . 



PROOF. The proof follows the lines of Theorem 5.2 with h t — hf\ involving how- 
ever different estimates for the expressions 

(6.5) Vt\EM*K ht )\ = Vt\ J E f(y)(J E (7r(x)-7r(y))K ht (x-y)dx)dy 
and 



EM supi 



A(g * K ht )(X u ) - (Ag) * K ht (X u )du 



(6.6) < sft f sup / {b{y)-b{x))\ w g{y)K ht {x-y)dy 

J E g£Q J 



respectively. V € Hd,(f3 + l,L;R d ) and (6.4) together imply that tt is Holder con- 
tinuous of order j3 + 1 on every bounded set D C K d . Since if is of order > d/2, 
Taylor expansion of n(y) around ir(x) in the inner integral of the right-hand side 
in (6.5) up to the \J3\ 'th order provides the bound 

Vt|E M (/*X- fct )| £ Sth% = o(l). 

Due to the absolute value involved in (6.6), a similar argument based on Fubini's 
Theorem as in (6.5) is not applicable, and we need to take advantage of additional 
smoothness of Vg as well. Let bi and (Vg)i denote the i'th coordinate of b and Vg, 
respectively. Let Py b '^ denote the Taylor polynomial of bi at the point y € R d up 
to the L/?J-th order, and similarly, Q^ 9 ^ denotes the Taylor polynomial of (V<7), 
at the point y € W l up to the order [f3\ — 1. Now, with the notation 



R$£ ■= h{y)-P b x '{y) and B%*< := (Vg)i(y) - Q ( ^(y), 
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we obtain 

(b(y)-b(x))\Vg)(y)K h (x-y)dy 



= E / ( p * (v) - &.-(*) + 4^) (0i V9)l (y) + Rg v B)i )Kh(x - y)dy 
»=i ^ 

(6.7) = £ / ^(Q^^ + ^^ftCx-^dy 

i= 1 

dl « 

(6.8) + J2 I (P*(y) - bi{x))R^K h {x - y)dy, 

i—l 

since K is of order 2[/3J — 1. As concerns the expression in (6.7), it remains to note 
that by the definition of H((3, L; C), 

sup \RM\ < L-hl sup \RZ 9h \ < L-ht 1 - 

x,y£C E : x,y£C e : 
\\x-y\\ 2 <h t \\x-y\\2<h t 

Furthermore, Qx Vs ^{y) is bounded uniformly in x,y, since all partial derivatives 
(V<?)i are bounded in absolute value by the envelope G and C £ is compact. There- 
fore, 

sup sup \Q^ g ^(y)\ < oo. 

see x.yt^Cl 

With regard to (6.8), 

sup \Pl i {y)-b i (x)\ <ht, while | flj^ 5 * | < L ■ /if -1 . 

Collecting these bounds yields sup sg g | J (b(y) — b{x]) t V w g{jj)Kh t {x— y)dy | < /if , 
that is, (6.6) = o(l). □ 



7. Discussion. In this article we analyze the empirical diffusion process in 

higher dimension. One motivation for our study was the observation due to van der Vaart and van Zanten 
(2005) that it is possible to prove uniform central limit theorems for empirical pro- 
cesses of scalar regular diffusions with finite speed measure under only pregaussian 
conditions. This remarkable result reflects the increased regularity of empirical pro- 
cesses of scalar diffusions due to the existence of local time, and the analysis of 
local time is an integral part of its proof. In higher dimensions, diffusion local time 
does not exist, and therefore it was not clear at all how to derive Donsker theorems 
under necessary and sufficient conditions. The question considered in this article 
is how to bring out some potentially increased regularity of empirical processes of 
multidimensional diffusions as compared to classical empirical processes based on 
iid M d -valued random variables. 
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We start by showing that there exist strong parallels to classical empirical process 
theory. If (the carre du champ of) the diffusion satisfies Poincare's inequality, then 
the classical condition of a finite bracketing entropy integral due to Ossiander can 
be used for proving Donsker theorems. Replacing the symmetrization device by an 
application of the generic chaining bound, it is also possible to deduce an analogue 
of Theorem 3.2 in Gine and Zinn (1984) which describes the effect of pregaussian- 
ness on the asymptotic equicontinuity criterion. This result simplifies the problem 
of verifying asymptotic equicontinuity of the empirical diffusion process in the pre- 
gaussian setting as it shows that it suffices to consider the supremum over balls 
with radius d<s(f,g) < 5 with 5 — (rj/Vt) 1 ^ 2 for some r\ > 0. One subtle difference 
to the case of classical empirical processes is however that the constraint is formu- 
lated in terms of the metric dc instead of a constraint in terms of || • ||^,2- Since 
Poincare's inequality merely yields an upper bound on d& in terms of \\A ■ ||^,2, 
it is not possible to use a decomposition based on the Cauchy-Schwarz inequality 
as typically done in the classical situation for verifying the simplified asymptotic 
equicontinuity criterion. 

However, it turned out that some modified version, the intermediate process in- 
dexed by smoothed functions, which does not appear naturally in the investigation 
first has remarkable regularity properties. Concerning the proof of this behavior, 
there are two crucial points. First, we do not rely mainly on classical empirical 
process theory but use tools from stochastic analysis such as martingale approxi- 
mation relating the empirical diffusion process to the generator of the associated 
Markovian semigroup. In particular, an essential part of the proof is the subgaus- 
sian exponential inequality for continuous martingales. This allows to reduce the 
proof of uniform weak convergence to deriving an upper bound on the expectation 
of the supremum of some stochastic process which is typically done by means of 
the chaining technique. Here again we find substantial differences as compared to 
the classical situation for the empirical process based on iid random variables; see 
the discussion below the proof of Theorem 4.3. The outstanding regularity of the 
intermediate process indexed by smoothed functions in dimension d — 2 is only 
detected by using very sharp results about the occupation measure of planar Brow- 
nian motion. Similar improvement of this type for d > 3, though less substantial, 
can presumably be derived again by fractal analysis of multidimensional diffusions. 

Furthermore, we would like to stress that for any dimension d, already the result 
of Theorem 4.3 shows a remarkable improvement on the restriction concerning the 
lower bound on the admissible bandwidth as compared to the smoothed empirical 
process based on independent and identically distributed random variables in R d ; 
see the remark on p. 17 and p. 3 of the Introduction. 

APPENDIX 

We collect some elementary functional analytic requisites for our proofs, which 
might be well-known, yet we did not find them anywhere explicitly stated in the 
present form. 
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Lemma A.l. Assume that g £ W 1,2 (d/x) and Kh is symmetric. Then it holds 
fi-a.s. 

d5(g*K h ) = (d°g)*K h for all a € {0,l} d with \a\ = 1. 

Proof. By definition of the weak derivative, it holds for all 4> & C^, 
d a (g * K h )(x)cj)(x)d\(x) = - [ (g * K h ) (x)d a J>(x)d\(x) 

g(y)K h (x - y)dy d a <j>(x)d\(x) 
d a <j>{x)K h {x-y)dxg{y)d\{y) 
= - / (d a ct>*K h )(y)g(y)d\(y) 
d a (0 * K h ) (y)g(y)dX(y) 
(A.l) = I dZg(y)^*K h )(y)d\(y) 



(dZg)*K h (x)<j>(x)d\(x), 

where the identity d a ((f> * Kh) = {d a 4>) * Kh in (A.l) is proved, for instance, in 
Lemma 5 a) in Ginc and Nickl (2008). □ 

Lemma A. 2. Let A be the generator of an Ito-Feller diffusion in E C M. d with con- 
tinuous drift and diffusion coefficient, b and a, respectively. Then any g £ W 2 ' 2 (d/i) 
of compact support in E\ dE belongs to ID) a, and 

d 



I, J — 1 l—l 

PROOF. Let g e W 2 ^ 2 (d^) be of compact support in E\dE. Let <f>h(-) = h~ d 4>(-/h) 
be a Dirac sequence with some twice continuously differentiable, symmetric kernel 
</>. Then g * <f>h is twice continuously differentiable and of compact support, hence 
g * 4>h £ Dyi for sufficiently small h, and it holds by Lemma A.l that 

dZ{g*<t>h) = {dZg)*(j>h for all multi-indices a £ {0,1, 2} d with |a| < 2. 

Therefore, 

1 . ,^ ( d l9 . A \ , ,^ ( d w g 



- - i \ Oyj X% 0<w Xj J . \ O'ujXi J 
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Since <j>h defines a Dirac sequence, — g*^|| 2 — >-0as/i\0 (cf. Theorem 8.14 
in Folland (1999)). Let 

G( X ) := iE^W^^W + Ewlf («)• 

2J — 1 J 2 — 1 

Denote the union of the supports of g and g * </>/j by . Then 

d 

||A( 5 *^)-G|| M>2 < 2 E W^cJ 

d 

+ ElMcJI 

1=1 

— ► as ft \ 0, 

because (</>/j) is a Dirac sequence, g G W 2,2 (d^t) and a(-) and b(-) are continuous, 
hence uniformly bounded on a decreasing sequence of compacts. But this implies 
G = Ag, since A is closed. □ 



dig 



* <Ph 



dig 



fj.,2 



d w g 



* <Ph 



d w g 



H,2 
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